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We define a normalized impact factor suitable to assess in a simple way both the strength of 
scientific communities and the research influence of individuals. We define those ones with NIF > 
1 as being scientific leaders since they would infiuence their peers at least as much as they are 
influenced by them. The NIF is distinguished because (a) this has a clear and universal meaning 
and (b) this is robust against self-citation expedient. We show how a single lognormal function 
obtained from a simplified version of the NIF leads to a clear "radiography" of the corresponding 
scientific community. As an illustration, this is eventually applied to analyze a community derived 
from the list of outstanding referees recognized by the American Physical Society in 2008. 

PACS numbers: 02.50.-r,01.75.+m,89.65.-s 



One of the most challenging aspects to define compre- 
hensive scientometric indexes concerns the fact that this 
is not obvious in general how to take into account the 
field idiosyncrasies On the other hand, some agree- 
ment on what scientometric guiding criteria are useful 
is crucial to allow the proper agencies to formulate gen- 
eral policies and optimize the use of financial resources. 
As a consequence a continuous effort to improve the ac- 
tual parameters can be witnessed. In order to take into 
account the publishing and citation traditions of differ- 
ent areas to rank scientific journals some authors have 
suggested, for instance, alternatives to the journal im- 
pact factor (JIF) One such example was proposed 
by Bergstrom who reports the development of an algo- 
rithm which captures the percentage of the time that li- 
brary users spend with a given journal ^^Eigenf actor 
Scores and Article Influence Scores rank journals much 
as Google ranks websites" . (See also Refs. Q for previ- 
ous related work.) More recently Nicolaisen and Frand- 
sen have defined the reference return ratio (3R). The 3R 
exhibits a strong correlation with the JIF yet the "3R 
appears to correct for citation habits, citation dynamics, 
and composition of document types" [g| . In the same year 
Zitt and Small proposed the audience factor (AU) 6] as 
a way to normalize the standard JIF by the journal field. 

As part of the broad program of defining scientomet- 
ric parameters whose interpretation is as independent as 
possible from the research field, we define here the nor- 
malized impact factor (NIF) to assess the strength of sci- 
entific communities and the influence of individual re- 
search. The NIF is distinguished because (a) this has a 
clear and universal meaning being applicable with equal 
efficiency to individuals belonging to quite distinct com- 
munities and (b) this is robust against self-citation ex- 
pedient. For the sake of illustration, we use the NIF to 
analyze a community derived from the list of outstand- 
ing referees recognized by the American Physical Society 
(APS) in 2008 "7]. We compare the NIF with the ft.-index 
and show that no clear correlation between them is seen 
indicating that both indexes carry different pieces of in- 
formation. Eventually we discuss the NIF limitations. 



FIG. 1: The vertical bars show the population probability 
density obtained from our data. The area below the graph 
gives the rate of individuals with SNIF in the corresponding 
interval. The solid line is a lognormal distribution fit: f{x) = 

g-(in{^)-A.)2/(2<T2)^y27"aa; {x = SNIF) with fi = -0.3 and 
a = 0.6. Note that 68% of the population has 0.4 < SNIF < 
1.3. Leaders comprise only 31% of the population. 




FIG. 2: The bars express the /i- index range assumed by indi- 
viduals with fixed SNIF. No obvious relationship between the 
SNIF and /i-index is seen: we can find leaders with relatively 
small /i-index and individuals with SNIF < 1 with relatively 
large ft-index. 



The NIF puts in context (i) the infiuence exerted by the 
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research of an individual with respect to (ii) how much 
this individual has been influenced by his/her scientific 
community. Our assumption is that while citations re- 
ceived by an individual reflects (i) [1, Q the bibliographic 
references listed by him/her must reflect (ii). Hence, we 
define the NIF of an individual as 

NIF ^ (1) 

where Ci and are the number of citations received and 
references included in his/her ith paper which is signed 
by Oi authors, respectively. Thus, if the first paper of 
the publication list of some researcher is signed, say, by 
3 authors (the researcher himself/herself and two other 
collaborators), has 30 references and has received 20 cita- 
tions, then oi = 3, ri = 30 and ci = 20, and so on for the 
other papers. The inclusion of avoids double count- 
ing of references and citations in multi-authored papers 
leading to the following important feature: in a closed 
community of identical individuals (i.e. who publish, ref- 
erence and are cited by each other at the same rate) all 
members have NIF = 1. This is so because each refer- 
ence corresponds to a citation. The NIF interpretation 
is, thus, straightforward. We denominate leaders individ- 
uals with NIF > 1 because they influence their peers at 
least as much as they are influenced by them. The exis- 
tence of an universal reference value (NIF = 1) is one of 
the most satisfying aspects of this factor. Other ones are 
much field dependent and, thus, more difficult to inter- 
pret. Moreover, this is not easy to artificially boost one's 
NIF through self citation because the inclusion of extra 
references increases the denominator of Eq. ([T]). How- 
ever, we note that the NIF is obviously silent about the 
intrinsic relevance of the research work itself. We might 
have leaders (in the sense that they possess NIF > 1) 
influencing people to work on issues which eventually do 
not pay off their research effort and vice versa. The NIF 
is clearly unable to evaluate the "leadership quality" . 

As an illustration, we analyze a community derived 
from the 2008 hst of APS outstanding referees 0|- The 
methodology used is as follows. We have randomly se- 
lected 283 individuals (among the 531 ones) and looked 
for their publication lists in the ISI Web of Science p^ . 
To avoid including homonyms, we have entered with 
the researcher name and affiliation exactly as appears in 
Ref. 0], and recorded the ones with 20 or more papers. 
The derived community suffices for our purposes. Even- 
tually 223 individuals (authoring 22,611 papers) passed 
the criterion above, comprising 592,429 references and 
597,390 citations. Incidentally the total number of refer- 
ences is almost the same as the total number of citations, 
which is a necessary (although not sufficient) requirement 
for (approximately) closed communities. Obviously, our 
223 individuals do not form a closed community. Because 
of resource limitations we have actually carried our anal- 
ysis using the simplified normalized impact factor (SNIF) 



defined as 

SNIF=^^. (2) 

The typical difference between the SNIF and the NIF for 
10 randomly chosen individuals was less than 10%. This 
simplified procedure has restricted our data collection to 
3 months and final update to 4 days [from June, 21st 
to 24th (2008)]. In Fig. [1] the vertical bars show the 
population probability density generated from our data. 
The area below the graph gives the rate of individuals 
with SNIF in the corresponding interval. Interestingly 
enough, Fig. [T] is well fit by a lognormal distribution : 

fix) = e-(i"(-)-^)'/(2.^VV2^<^^, 

where x = SNIF, and /i ~ —0.3, a ~ 0.6 for the com- 
munity under analysis. As a matter of fact, fi and a 
will depend on the community but f{x) is always unit 
normalized: 

/•OO 

/ f{x)dx = 1. 
Jo 

We can see fi and a as being the "community fingerprint" . 
The fact that our sample comprises from high- energy ex- 
perimentalists to condensed- matter theoreticians and our 
data are fit by a single lognormal distribution supports 
our hope that the (S)NIF can be successfully applied to 
quite distinct populations keeping its simple and universal 
interpretation. We see that 68% of the population will 
be found between exp(/i — cr) « 0.4 and exp(/i-|-cr) ~ 1.3. 
We obtain, thus, from Fig. [T] that 68% of our population 
has 

0.4 < SNIF < 1.3. 

The median which separates the population in two halves 
with respect to the SNIF value is given by 

SNIF EE exp(^) = 0.74. 

Thus, half of the population has SNIF > 0.74. The SNIF 
expected value (or mean value) is given by 

/"OO 

£;(SNIF) = / dxxfix) = e^+'''/2 ^ Q gg^ 

Note that £'(SNIF) > SNIF. It can be numerically cal- 
culated from Fig.{J\that leaders comprise only 31% of the 
population. (We shall comment that the present author 
does not belong to the leader set.) 

Up to this point we have used the (S)NIF to put in 
context the scientific work of individuals with respect 
to their peers. Now, we go further and propose a way 
to assess the strength and homogeneity of the scientific 
community itself. This is possible because of the nice fit 
provided by the lognormal function for the SNIF distri- 
bution. Let us notice that the larger the median, exp(/i). 
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is the more "stretched" towards large NIFs Fig.[T]win be. 
This suggests the following definition for the community 
strength: 

CS — exp(/i). 

Similarly, the smaller the a is, the stiffer the graph will 
be, in which case most people would have the same corre- 
sponding NIF and the community would be considerably 
homogeneous. A prototype of a homogeneous community 
would be the one described in the beginning, namely, 
a closed community of identical individuals having all 
members NIF = 1. This suggests the following relation 
as a measure of the community homogeneity: 

CH = 1/a. 

By the same token, the larger the a is the flatter the 
distribution will be, which would characterize a nonho- 
mogeneous community consisting of members with quite 
distinct NIF values. 

Now, for the sake of comparison we show in Fig. [2] 
the h-index 12 1 range (see also Ref. [3, [13]) assumed by 
individuals in a fixed SNIF interval. No obvious relation- 
ship between "large" /i-index and scientific leadership in 
the sense defined above is seen: we find leaders with rela- 
tively small h-'mdex and nonleaders with relatively large 
/i-index. Notice that in general a correlation between the 
/i- index and the number of papers is expected to exist. 
Let us consider for a moment the extreme case where 
every papers of some individual cite all his/her previous 
ones. In this case, the corresponding h-index would be 
N/2 and {N — l)/2 for N even and odd, respectively, 
only due to self citation, where N is the total number of 
papers. This can lead to quite large values. Although 
this is an unrealistic case, it illustrates the possible influ- 
ence of the paper number on the h-index. This is not so 
with the NIF which is basically insensitive to the paper 
number. The NIF is not expected to exhibit more than a 
moderate correlation with the h-index. As a result, the 
pieces of information carried by the NIF and h-index are 
fairly distinct. 

In summary, assuming that our hypotheses (i) and (ii) 
above are meaningful, it seems fair to support that the 
NIF provides useful and novel information about the sci- 
entific work of individuals and of their corresponding 
communities. However, we would like to make some cau- 
tion remarks concerning the use of this index. (The same 
remarks apply to the SNIF.) The first one, is that the 
NIF should be used to assess senior rather than junior 
scientists. How to define seniority is an open question 
on which some agreement has to be reach. The NIF of 
junior researchers will exhibit in general a strong tran- 
sient profile as a function of time and will be mostly 
infiuenced by the supervisor expertize in which case the 
"leader" label would not apply. Moreover, the NIF does 
not distinguish between protagonist and coadjuvant au- 
thors cosigning the same papers. The reliability of the 
NIF as a way to assess leadership will depend on how 



much one's publication list does reflect the researcher 
own expertize rather than the one of his/her collabo- 
rators. In addition, researchers who solve a controversy 
may be less cited than the ones who have raised it. De- 
pending on the case, this may induce the NIF of the for- 
mer ones to be smaller than the NIF of the latter ones. In 
order to deal with these cases, other aspects (mostly sub- 
jective ones) should be taken into account [1^]. Next, the 
NIF is robust against self-citation expedient but not com- 
pletely immune to it. Self citations tend to increase and 
decrease the NIF value for individuals with NIF < 1 and 
NIF > 1, respectively. If we assume the usual case where 
the number of self citations (NSC) is small in comparison 
with the total number of citations (TNC) received by a 
researcher, NSC/TNC < 1, it is easy to get from Eq. ^ 
that nonleaders will have the NIF value increased due to 
self citation by a factor of order 1 -I- NSC/TNC w 1 hav- 
ing, thus, (some but) not a dramatic net influence on the 
final NIF value. The NIF can be also infiuenced depend- 
ing on the individual care and generosity when acknowl- 
edging the infiuence of others by including more or less 
references. However, we do not expect it to have a huge 
infiuence either. Any unusual practice or artificial ten- 
tative to boost one's NIF by omitting crucial references 
should face prompt opposition from the peer-review sys- 
tem: whenever this is possible and necessary referees help 
authors to improve their papers by calling attention to 
related references. On the other hand, the NIF adop- 
tion would probably drive authors not to include irrele- 
vant references with the aim of receiving extra citations 
as a counterpart, which we see as being a "purifying" 
feature. Some concern can be also raised about how the 
presence of reviews in the publication list could affect the 
NIF. Although reviews tend to include long reference lists 
reflecting the contribution of various individuals to the 
corresponding topic, they also tend to be heavily cited 
because of its comprehensiveness. Eventually both ten- 
dencies should compensate each other having no major 
effect in the NIF of senior scientists. 

Despite the NIF robustness, obviously no single pa- 
rameter is able to clear-cut leaders from nonleaders. In 
spite of the possible difficulties which the NIF may face 
when applied to assess individuals, we would like to em- 
phasize that our main motivation to introduce this index 
was to provide a tool to produce more accurate "radio- 
graphics" of research communities as a whole (in which 
case individual idiosyncrasies are "averaged" and should 
not have any relevant net influence in the final diagnosis) 
leading thus to more efficient scientific policies. This is 
particularly important to communities in emergent coun- 
tries which are still fixing their standards and aim to have 
eventually a protagonist role in the global scenario. 
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